Evaluating the robustness of source code plagiarism detection tools to pervasive plagiarism-hiding modifications

نویسندگان

چکیده

Source code plagiarism is a common occurrence in undergraduate computer science education. In order to identify such cases, many source detection tools have been proposed. A tool evaluates pairs of assignment submissions detect indications plagiarism. However, plagiarising student will commonly apply plagiarism-hiding modifications an attempt evade detection. Subsequently, prior work has implied that currently available are not robust the application pervasive modifications. this article, 11 evaluated for robustness against The with data sets simulated plagiarism, constructed representative students. results performed evaluations indicate which fine-grained transformations structure. Of tools, JPlag and Plaggie demonstrates greatest different types also graph-based specifically those compare programs as program dependence graphs, show potentially greater

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating the Performance of LSA for Source-code Plagiarism Detection

Latent Semantic Analysis (LSA) is an intelligent information retrieval technique that uses mathematical algorithms for analyzing large corpora of text and revealing the underlying semantic information of documents. LSA is a highly parameterized statistical method, and its effectiveness is driven by the setting of its parameters which are adjusted based on the task to which it is applied. This p...

متن کامل

Normalization based Stop-Word approach to Source Code Plagiarism Detection

This paper is a report of PES Institute of Technology’s participation in the Cross Language Detection of Source Code Reuse (CL-SOCO) task at FIRE 2015 [1]. We approach this task as text document plagiarism task, without considering formal programming language grammatical structure. We use normalization of commonly used identifiers to detect pair of programs which have the same objective. We als...

متن کامل

Evaluating Robustness for 'IPCRESS': Surrey's Text Alignment for Plagiarism Detection

This paper briefly describes the approach taken to the subtask of Text Alignment in the Plagiarism Detection track at PAN 14. We have now reimplemented our PAN12 approach in a consistent programmatic manner, courtesy of secured research funding. PAN 14 offers us the first opportunity to evaluate the performance/consistency of this re-implementation. We present results from this re-implementatio...

متن کامل

Plagiarism Detection in Computer Code

The ease with which computer code can be copied by students presents opportunities for plagiarism on programming projects. Presently, instructors are provided with few tools (if any) that assist them in detecting possible plagiarism in an assignment. Therefore, the only solution is a tedious and error prone by hand check of student submissions, looking for similarity between any two. In this th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Empirical Software Engineering

سال: 2021

ISSN: ['1382-3256', '1573-7616']

DOI: https://doi.org/10.1007/s10664-021-09990-4